Effect of Cost Function Complexity and Dimensionality on Newton’s Method Convergence Rate

نویسنده

  • Scott Reid
چکیده

Today, Newton’s Method is commonly used in machine learning and optimization to locate local minima of cost functions. Because it is such a general tool, the cost functions that is implemented on can range widely in terms of both their dimensionality and their complexity. However, little is understood about how Newton’s Method performs on cost functions with different dimensionalities and complexities. In the simplest case, where the cost function is a paraboloid, one can show that Newton’s Method converges in a single iteration to the minimum. We can increase the complexity of our cost functions by considering perturbative deviations from the paraboloid cost function. Intuitively, one should expect that as we increase the cost function complexity, the convergence rate will decline. However, as of now, there is no simple way to parametrize the complexity of a cost function, and thus, little is known about the way that cost function complexity affects the convergence of Newton’s Method. In this paper, I introduce a method of generating perturbed cost functions which allow us to systematically deviate from the simple paraboloid case. These perturbed cost functions are parametrized by a parameter N , which describes the number density of gaussian pertubation sites and is thus related to the complexity of the generated cost functions. By empirically fitting a model to convergence rate data, I find that the convergence rate depends strongly on both N and D (the dimensionality of the cost function). This dependence on N and D leads me to the hypothesis that the convergence rate of Newton’s Method depends directly on the mean spacing between pertubations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Economic Production Quantity Models with Shortage, Price and Stock-Dependent Demand for Deteriorating Items

This investigation presents an economic production quantity (EPQ) model for deteriorating items with stock-dependent demand and shortages. It is assumed that a constant fraction of the on-hand inventory deteriorates and demand rate depends upon the amount of the stock level. Expression for various optimal indices as well as cost analysis are provided. By taking numerical illustration, sensitivi...

متن کامل

A Six-order Modification of Newton’s Method For Solving Nonlinear Equations

A new six order variant of Newton’s method has been developed by extending a third order method of Ozban [14] and its convergence properties have been discussed. The order of convergence of the proposed method is six. Starting with a suitably chosen x0, the method generates a sequence of iterates converging to the root. The convergence analysis is provided to establish its sixth order of conver...

متن کامل

Newton’s Method for Optimization in Jordan Algebras

We consider a convex optimization problem on linearly constrained cones in a Euclidean Jordan algebra. The cost function consists of a quadratic cost term plus a penalty function. A damped Newton algorithm is proposed for minimization. Quadratic convergence to the global minimum is shown using an explicit step-size selection.

متن کامل

Higher order Matching Pursuit for Low Rank Tensor Learning

Low rank tensor learning, such as tensor completion and multilinear multitask learning, has received much attention in recent years. In this paper, we propose higher order matching pursuit for low rank tensor learning problems with a convex or a nonconvex cost function, which is a generalization of the matching pursuit type methods. At each iteration, the main cost of the proposed methods is on...

متن کامل

A Distributed Newton Approach for Joint Multi-Hop Routing and Flow Control: Theory and Algorithm

The fast growing scale and heterogeneity of current communication networks necessitate the design of distributed cross-layer optimization algorithms. So far, the standard approach of distributed cross-layer design is based on dual decomposition and the subgradient algorithm, which is a first-order method that has a slow convergence rate. In this paper, we focus on solving a joint multi-path rou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017